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FIG. 7 



1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAT^TGGAGA TGGATATTGA 

101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 

151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 

2 01 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 

2 51 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

3 01 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

3 51 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

4 01 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 

4 51 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

5 01 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 

5 51 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 

6 01 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

7 01 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

8 01 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
8 51 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

10 01 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 

1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 

1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 

12 01 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 

12 51 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 

13 01 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

13 51 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 

14 01 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 

14 51 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 

15 01 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 

15 51 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 

16 01 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GATUVAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 



AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA- 
ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 
CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
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24 01 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

24 51 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 

25 01 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

25 51 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

26 01 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

26 51 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 

27 01 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2 8 01 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2 8 51 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2 901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 

2 951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 0 01 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
3 051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3 2 01 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3 251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 



MEVDVDNQAD 
LEDIKIFAQT 
SSSDVSDRQK 
IGNELFRHLY 
ADMNEPRCCS 
NQFFKKHEFV 
LEKVKDFNFN 
YTTDNKCVTQ 
KNLLLEKINT 
LIRCFFYVTE 
EEWKKSLGFA 
NSHLMLKTLK 
TMDIEKCYDS 
FRKKEMKDYF 
RNYFKKDNLL 
ATLEESSLGF 
INVSRENGFK 
SIDMKTLALM 
HYFRKTITTE 
LEVSKIIYSV 
KYIFNRVCMI 



NHGIHSALKT 
NIVATPRDYN 
LQCFGFQLKG 
TKYLIFQRTS 
TCKYNVKNEK 
SNKNNISAMD 
YYLTKSCPLP 
FINEFFYNIL 
REISWMQVET 
QQKSYSKTYY 
PGKLRLIPKK 
NRMFKDPFGF 
VNREKLSTFL 
RQKFQKIALE 
QPVINICQYN 
LRDESMNPEN 
FNMKKLQTSF 
PNINLRIEGI 
DFANKTLNKL 
TRAFFKYLVC 
LKAKEAKLKS 



CEEIKEAKTL 
EEDFKVIARK 
NQLAKTHLLT 
EGTLVQFCGN 
DHFLNNINVP 
RAQTIFTNIF 
ENWRERKQKI 
PKDFLTGRNR 
SAKHFYYFDH 
YRKNIWDVIM 
TTFRPIMTFN 
AVFNYDDVMK 
KTTKLLSSDF 
GGQYPTLFSV 
YINFNGKFYK 
PNVNLLMRLT 
PLSPSKFAKY 
LCTLNLNMQT 
FISGGYKYMQ 
NIKDTIFGEE 
DQCQSLIQYD 



YSWIQKVIRC 
EVFSTGLMIE 
ALSTQKQYFF 
NVFDHLKVND 
NWNNMKSRTR 
RFNRIRKKLK 
ENIilNKTREE 
KNFQKKVKKY 
ENIYVLWKLL 
KMSIADLKKE 
KKIVNSDRKT 
KYEEFVCKWK 
WIMTAQILKR 
LENEQNDLNA 
QTKGIPQGLC 
DDYLLITTQE 
GMDSVEEQNI 
KKASMWLKKK 
CAKEYKDHFK 
HYPDFFLSTL 
A 



RNQSQSHYKD 
LIDKCLVELL 
QDEWNQVRAM 
KFDKKQKGGA 
IFYCTHFNRN 
DKVIEKIAYM 
KSKYYEELFS 
VEIiNKHELIH 
RWIFEDLWS 
TLAEVQEKEV 
TKLTTNTKLL 
QVGQPKLFFA 
KNNIVIDSKN 
KKTLIVEAKQ 
VSSILSSFYY 
NNAVLFIEKL 
VQDYCDWIGI 
LKSFLMNNIT 
KNLAMSSMID 
KHFIEIFSTK 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

a PQNPKTPKPL*KKKKLR*FR 

b PKTPKPQNPYKKRKNCGSLE- 

c PKPQNPKTPIKKEKIEVV*K- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NKILFPHKWRWILIWMI*KI 

b IKYYSRTNGDGYCFGCYRKF- 

c *NIIPAQMEMDIDLDDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTSIAALVVTRKDAK 

b TS*YIQQV*QLL**QERMQN- 

C LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a HCNLARNRLHCLFQSCKNN* 

b lEIWLEIAFI DYSKVAKTIR- 

C LKSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

a SSTSRMQIFITILSCEN*F* 

b VLLLGCKSL*RFFLEKISFK- 

C FYFSDANLYNDSFLRKLVL K- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE*KLKHY*CLNKIR* 

b KRRAKSRNCNITNV*IKSGN- 

c SGEQRVEIETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 + + + + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a CGLFYFLDHFLRSIMEKIT* 

b EDYSIF*ITS*GALWRKLLN- 

C RIILFFRSLLKEHYGENYLI- 

FIG. 12A 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSLDYFPSQQCCVY*I 

b TKR*TVWIISLANNDEYIKF- 

C LKGKQFGLFP*PTMMSILNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRISIHQTYQRQTRY 

b ICECVKGSRYIRLTKDKLAI- 

C YENESKDLDTSDLPKTNSL*- 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 + + + + 4- + 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKVC*SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR- 

C NARKSLIIEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 — + + + + + + 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRRCTPESCDN 

b MGFITIVLGIDGELPSLETI- 

c WVLLQLF*VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 + + + + + + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a CKSCLQLKESQFCKF*CVCH 

b EKAVYN*RNRSSESSDVYAI- 

C KKLFTTEGIAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

C FCELISNILSQFNG*L*KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT- 

C K*TMQV*WNIR*ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACGTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a CIYIGFLKHRYTECFRDCFS 

b EFILDS*SIDTQNALETDLA- 

c NLYWILKA*IHRML*RLI*L- 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + + + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL*KKQ- 

C TTDYLF*LLLLISYIFKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 
961 + ^ + + + + 1Q2Q 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEISKFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

C RNEKKTKERDFKIC*FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS*F 

b ELTTRILATKKKKSYHNPDS- 

c N*QQEY*QRKRRRAITILIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a liKISKIPGKRDTFIKIHIL* 

b *RPQKFQVREIHSLKFIYYS- 

C KDFKNSR*ERYIH*NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS*QYFLISWK- 

C FHFTAVIFFYLNNIF*LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

C KKYQIREALD*GNLAYSHS*- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

C IDLHISNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a *CYED*IFRVKKWSRNLNQK 

b SAMRTKFLESRNGAEILIKK- 

C VL*GLNF*SQEMEPKS*SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRRYCKRIEL*IFR**VLP 

b NCVDIAKESNSKSFVNKYYQ- 

c lASILQKNRTLNLSLISITN- 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 
1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILIDCRD*RGNCTEDH*RNK 

b SCLIEEIDEATAQKIIKEIK- 

C LDCLKRLTR QLHRRSLKK*S- 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 + + + + + -f 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN*RIN*ITNIEISDLQ 

b *LLLIRE*TKLLI*RSAIFN- 

C NFY*LENKLNY*YRDQRSSI- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK*KLN*S*TIKNTNLGQN 

b CRNKSCTKVRQ*KIQTLVKI- 

C DEIKAELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGKEDQLAKEKIRQ*IKC 

b LRKEKKTS*QKKK*GNK*NE- 

C C G R K R R_ P VSKRKNK AINKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQKCRNKRFIFFNNLLKRGV 

b YRSEEIKDLFFSIIYCKEGF- 

C TEVKK *KIYFFQ*FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + — 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
c GVLGFW 
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2 EVDVQNQADNHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNQSQSHYKDL 51 
|::: | :.::|: :| |.::| ::. . | |..|.|.| 

19 ELELEMQENQNDIQVRVK. . . . IDDPKQY. .LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF.STGLMIELIDKCLVELL 100 

: .|.| ..|.. .|: ..:| | ...::|.: 
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108 CWHKNTQPFIEKYFNBCAVLLPNDLLEVCEFAQVLYI 144 

151 IGNELFRHLYTKYLIFQRTSEGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 2 00 

I :|| | ..|..,. ::: :: 

14 5 FDATEFKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE. . .PRCCSTCKYNVKJS[EKX)HFLmiNVPNWNNMKSRTRIFyCTHF 247 

182 SEFNEYQLGKYCTES . . QRKKTMFRYLSVTNKQKWDQTKKK 220 

24 8 NRNNQFFKKHEFVSNKNNISAMDRAQTIFTNIFRFNRIRKKLKDKVIEKI 2 97 

|.: ...| :. I : :|:..: i ||. |:-M 

221 .RKENLLTKLQAIKESEDKSKRETG DIMNVEDAIKALKPAVMKKI 264 

2 98 AYMLEKVKDFNFNYYLTKSCPLPENWRERKQKIENLINKTREEKSKYYEE 347 

I . . :| :. I ..: |.| |.| :.: 
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2 95 LIKFCHISEP KERVYKILGKKYPKTEEEYKAAFGDSASAPFN . PE 33 8 

3 98 LIHKNLLLEKINTREISWMQVETSAKHFYYFDHENIYVLWKLLRWIFEDL 44 7 

I |.. :| .|:| .: . :..|. : : .| ... :|| : : 
339 LAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSN. . 386 

44 8 WSLIRCFFYVTEQQKSYSKTYYYRKNIWDVIMKMSIADLKKETLAEVQE 4 97 

hi 

3 87 ILKAGVSD 3 94 

49 8 KEVEEWKKSLGFAPGKLRLIPKKTTFRPIMTFNKKIVNSDRKTTKLTTNT 547 

395 TTHS 398 

54 8 KLLNSHLMLKTLKNRMFKDPFGFAVFNYDDVMKKYEEFVCKWKQVGQPKL 597 

399 IVINK ICEPKAVENSKM 415 

59 8 FFATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRKNNIVID 64 7 

I :. :.|| I :|,.| .: :.:|| |: |:: 

416 F. . PLQFFSAIEAVN.EAVTKGFKAKK. . . RENMNLKGQIEAVKE . .WE 457 

64 8 SKNFRKKEMKDYFRQKFQKIALEGGQYPTLFSVLENEQNDLNAKKTLIVE 697 
..: ||:| .||..: .,:..| |. .| 

45 8 KTDEEKKDM ELEQTEEGEFVKVNEGIGKQYINSIELAIK 4 96 

69 8 AKQRNYFKKDNLLQPVINICQYNYINFNGKFYKQTKGIPQGLCVSSILSS 74 7 

4 97 lAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGL 54 6 
74 8 FYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFI 797 
54 7 MVKQRCE KS S F Y I F S S PS SQCNKC YLE VDL 576 
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7 98 EKLINVSRENGFKFNMKK.LQTSFPLSPSKFAKYGMDSVEEQNIVQDYCD 846 

-l-l II-- -|:.: --I I 

577 PGDELRPSMQKLLQEKGKLGGG . . TDFPYECIDEWTKNKTHVD 617 

84 7 WIGI S IDMKTLALMPNINLRIEGILCTLNLNMQTKKASMWLKKKLKSFLM 8 96 

|.| II.. . .:||:| .:|: ||.|. : 

618 NIVILSDMMIAEGYSDINVRGSSIVNSI KKYKDEVN 653 

8 97 NNITHYFRKTITTEDFANKTLNKLFISGGYKYMQCAKEYKD.HFKKNLAM 94 5 

||. : .:. |::: |::. :.|:.: :: | ::| 

654 PNIKIF. . . AVDLEGYG KCLNLGDEFNENNYIKIFGM 687 

94 6 SSMIDLEVSKIIYSVTRAFFKYLVCNIKDTIFGEEHYPDFFLSTLKHFIE 9 95 

|. I :|:: ... :. :::| 
688 SDSI LKFISAKQGGA NMVE 706 

996 IFSTKKYIFNRVC 1008 

707 VI . . KNFALQKIG 717 
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132 LSTQKQYFFQDEWNQVHAMIGNEL.FRHLYTKYLIFQRTSE . , GTLVQFC 178 

H ----Mill : | | | . 

1 MSRRNQ KKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQI 43 

17 9 GNNVFDHLKVNDKFDKKQKGGAADMNEPRCCSTCKYNVKNEKDHFLNNIN 228 
:: :. ||..:. :. --|...| :| | .|..:.:|..:| 

44 KEEDLKLLKFKNQDQDGNSGNDDDDEE NNSNKQQELLRRVN 84 

22 9 VPNWNNMKSRTRIFYCTHFNRNNQFFKKHEFVSNKNNISAMDRAQTIF™ 2 78 

|::|| |:.| : . .| 

8 5 QIKQQVQLIKK. . . VGSKVEKDLNLNEDENKKN 114 

2 7 9 I FRFNRIRKKLKDKVI EKIAYMLEKVKDFNFNYYLTKSCPLPENWRERKQ 32 8 

- I- • : I . -II • 

115 GLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRRETDY 164 

329 KIENLINKTREEKSKYYEELFSYTTDNKCVTQFINE . FFYNILPKDFLTG 377 

..|.::: .:::| ' .:|. : |: | : ||::. 

165 DTEKWFEISHDQK NYVSIYANQKTSYCWWLKDYFNK 200 

3 78 RNRKNFQKKVKKYVELNKHELIHKNLLLEKINTREISWMQVETSAKHFYY 427 

• I .. |:. : : :.|. : \ : \ : . . 

201 NNYDHLNVSINRLE . . TEAEFYAFDDFSQTIKLTNNSYQTVNID 242 

428 FDHENIYVLWKLLRWI . . FEDLWSLIRCFFYVTEQQKSYSKTYYYRKNI 475 

.: :| |||:: :| : : ||: : | .| .:.|. . ,.| 

243 VNFDNNLCILALLRFLLSLERFNILNIRSSY. . TRNQYNFEKIGELLETI 2 90 

476 WDVIMKMSIADLKKETLAEVQEKEVEEWKKSLGFAPGKLRLIPKKTTFRP 525 

2 91 FAWFSHR HLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQ 33 0 

52 6 IMTFNKKI VNSDRKTTKLTTNTKLLNSHLMLKTLKNRMFKDPFGFAVFNY 575 

: !...: I |.|: . |:|.. .|. . . || |. 

331 VYSFSTDLKLVD. . TNKVQDYFKFLQEFPRLTHVSQQAIPVSATNAVENL 378 
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1 


MEMDIDLDDIENL LPNTFNKYSSSCSDKKGCKTLKSGSKSPS , . . 


42 


491 


lELAIKIAWKNLDEIKGHTAIFSDVSGSMSTSMSGGAKKYGSVRTCLEC 


540 


43 


. LTIPKLQKQ LEFYFSDANLYNDSFLRKLVLKSGEQRVEIETLL 
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85 
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telomerase p43 
human La I 
Xenopus LaA I 
Drosophila La I 
S . c . Lhplp 



LQK2LEFYFSDANLYNDSFIRKLVLKSGEQRVEIETLLM 
CHQUE3YYFGDFNLPRDKFLKEQI . KLDEGWVPLEIMIK 
CE3IEYYFGDHNLPRDKFLKQQI . LLDDGWVPLETMIK 
LR3VEYYFGDANLNR3KFIREQIGKNEDGWVPLSVLVT 
CLK2VEFYFSEFNFPY3RFLRTTAEK.NDGWVPISTIAT 



FIG. 18 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
241 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
3 01 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca aaggacacac 
1621 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1741 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat. gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
2041 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
2221 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 



FIG. 19 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLA 
VYIRNELYIRTTTNYIVAFCWHKNTQPFIEKYFNKAVLLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMPCAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKILGKKYPKTEEEYKAAFGDSASAPFNPELAGKRMKIEISKTW 
ENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQIEAVKEWEKTDEEKKDMELEQTEEGEFVKVNEGIG 
KQYINSIELAIKIAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGA 
KKYGSVRTCLECALiVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDEFNENNYIKIFGMSDSILKFISAKQGGANM 
VEVIKNFALQKIGQK 

FIG. 20 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQ I KQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVSIYANQKTSYCWWLKDYFNKNNYDHLNVSINRLETEAEFY 
AFDDFSQT I KLTNNS YQTVNI DVNFDNNLC I LALLRFLL.S LERF 
NILNIRSSYTRNQYINFFEKIGELLETIFAVVFSHRHLQGIHLQVP 
CEAFQ YLVNS S S Q I S VKDSQLQVTS FSTDLKLVDTNKVQD YFKF 
LQE FPRLTHVSQQAI PVSATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKI KAF I LKR YYLLQYYLD YTKLFKTLQQLPELNQVY INQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 22 



MKILFEFIQDKIiDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
NYTVIQFNGQFFTQIVGNRCNEPHLPPKWVQRSSSSSATAAQIK 
QLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
IFPTNLVKIPQRLKVRINLTLQKLLKRHKRLNYVSILNSICPPL 
EGTVLDLSHLSRQSPKERVLKFIIVILQKLLPQEMFGSKKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTIVYFR 
HDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTIiSNFNHSKMRI 
IPKKSNNEFRIIAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKR PTS FTKI YS PTQ I ADR I KE FKQRLLKKFNNVL PEL YFMK 
FDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KLFNWNASRVPKPYELYIDNVRTVHLSNQDVINWEMEIFKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTL I LKLADDFL 1 1 STDQQQVINI KKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
S KG I FRS L I AL FNTR I S YKT I DTNLNS TNT VLMQ I DH WKN I S E 
CYKSAFKDLSINVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIE 
YEVRFTILNGFLESLSSNTSKFKDNIILLRKEIQHLQAYIYIYI 
HIVN 

FIG. 23 



1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1861 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 
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Oxytricha 
Euplotes 



LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 



FIG, 24 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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FIG. 27 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 



FIG. 28 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRKR 
TI ETS I TQNKSARKEVSWNS I S I SRFS I FYRSS YKKFKQDLYFNLHS I CDRNTVHMWLQWI FPRQFG 
LINAFQVKQLHKVIPLVSQSTWPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTN 
QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIIlSrNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 
lYRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 
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S-l: FFY VTE TTF QKN RLE FYR KSV WSK 

S - 2 : RQH LKR VQL RDV SEA EVR QHR EA 

S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 



A- 1 : AKF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 



FIG, 32 
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Vector 



A5 



B2' 



5825 bp Sequenced 
2 kb Hind III Fragment 



Genomic DNA Insert 



Vector 



1 kb 



FIG. 33A 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1 , Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Telomerase Gene::M 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



Wild Type (X2) 



2. Assay in selective media 



3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 43 
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wt haploid 



Tetrad 




Generations 
after sporulation 



Telomeric 
Apa I 
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600 bp 
500 bp 
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CD 

UL 




met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAG GTC GTC GAG CTG CTC 



10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG AGO ACG TTT CAA AAG AAC AGG 



30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser lie 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 



40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 



60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 



70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 



100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 



ser val leu asn tyr 
AGC GTG CTC AAC TAC 

130 

ala ser val leu gly 
GCC TCT GTG CTG GGC 



120 

glu arg ala arg arg 
GAG CGG GCG CGG CGC 



leu asp asp ile his 
CTG GAC GAT ATC CAC 



pro gly leu leu gly 
CCC GGC CTC CTG GGC 

140 

arg ala trp arg thr 
AGG GCC TGG CGC ACC 



150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 



160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 



180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 



190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 
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210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGO CCT TCA AGA GCC ACG TCC TAG GTC GAG TGC 



gin gly lie pro gin gly ser ile leu ser thr leu leu cys ser 
CAG GGG ATC CCG GAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAG GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT CGG 

250 - 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 
CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CCT CAC CTC ACC CAC GCG AAA ACG TTC CTC AGG ACG CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GGT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 
TTT GTT CAG ATG CCG GCC CAC GGC GTA TTC GCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAG TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGG GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GGT GGG AGG AAC ATG CGT CGG AAA CTC TTT GGG GTC TTG 

370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 
CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 

390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAG AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAG AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 



220 



230 
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420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



. LWSLIRCFFYVTEQQKSYSKT . . . 
. FIIPILQSFFYITESSDLRNRT. . . 
. LIPKIIQTFFYCTEISSTVTIV. , . 

.YWELLRSFFYVTETTFQKNRL. . . 
FFY TE 



Motif 0 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



K 

p hhh K hR h R 

. KSLGFAPGKLRLIPKKT- -TFRPIMTFNKKIV 
. QKTTLPPAVIRLLPKKN- -TFRLITNLRKRFL 
. TLSNFNHSKMRI I PKKSNNEFRI I AI PCRGAD 
. ARPALLTSRLRFIPKPD- -GLRPIVNMDYWG 
R PK R I 



Motif A 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



AF 

h hDh GY h 
. PKLFFATMDIEKCYDSVNREKLSTFLK . 
. RKKY F VR I D I KS C YDR I KQDLMFR I VK . 
. PELYFMKFDVKSCYDSIPRMECMRILK. 
. PELYFVKVDVTGAYDTIPQDRLTEVIA . 

F D YD 



// 



Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



hPQG pS hh 
. NGKFYKQTKG I PQGLCVS S I LS S F YYA . 
.GNSQYLQKVGIPQGSILSSFLCHFYME . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD . 
.RATSYVQCQGIPQGSILSTLLCSLCYG. 
G QG S 



Motif C 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



h F DD hhh 
. PNVNLLMRLTDDYLLITTQENN . . 
.KKGSVLLRWDDFLFITVNKKD. . 
. SQDTLILKLADDFLIISTDQQQ. . 
. RRDGLLLRLVDDFLLVTPHLTH . . 
DD L 



Motif D 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



Gh h cK 
. NVSRENGFKFNMKKL . 
. LNLSLRGFEKHNFST . 
. KKLAMGGFQKYNAKA . 
. LRTLVRGVPEYGCW . 

G 
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1 GCAGCGCTGC GTCCTGCTGC 

51 CCGCGATGCC GCGCGCTCCC 

101 AGCCACTACC GCGAGGTGCT 

151 GCCCCAGGGC TGGCGGCTGG 

2 01 CGNTGGTGGC CCANTGCNTG 
251 CCCGCCGCCC CCTCCTTCCG 

3 01 CCGAGTGCTG CANANGCTGT 
351 TCGGCTTCGC GCTGCTGGAC 

4 01 ACCACCAGCG TGCGCAGCTA 
451 GGGGAGCGGG GCGTGGGGGC 
501 TGGTTCACCT GCTGGCACGC 
551 TGCGCCTACC ANGTGTGCGG 
601 TCAGGCCCGG CCCCCGCCAC 
651 CAACGGGCCT GGAACCATAG 
701 CCAGCCCCGG GTGCGAGGAG 
751 GTTGCCCAAG AGGCCCAGGC 
8 01 CCGTTGGGCA GGGGTCCTGG 
851 GACCGTGGTT TCTGTGTGGT 
901 CTCTTTGGAG GGTGCGCTCT 
951 GCCGCCAGCA CCACGCGGGC 

10 01 GGGACACGCC TTGTCCCCCG 

10 51 TCCTCAGGCG ACAAGNACAC 

1101 AGGCCCAGGC TGACTGGCGT 

1151 TTCCAGGCCT TGGATGCCAG 

12 01 GCGNTACTGG CAAATGCGGC 

12 51 CGCAGTGCCC CTACGGGGTG 

13 01 GCGGTCACCC CAGCAGCCGG 

13 51 TGTGGCGGCC CCCGAGGAGG 

14 01 CTGCTCCGCC AGCACAGCAG 

14 51 CTGCCTGCGC CGGCTGGTGC 

15 01 AACGCCGCTT CCTCAGGAAC 

15 51 GCCAAGCTCT CGCTGCAGGA 

16 01 CGCTTGGCTG CGCAGGAGCC 

16 51 ACCGTCTGCG TGAGGAGATC 

17 01 GTGTACGTCG TCGAGCTGCT 

17 51 GTTTCAAAAG AACAGGCTCT 

18 01 TGCAAAGCAT TGGAATCAGA 

18 51 CTGTCGGAAG CAGAGGTCAG 

19 01 GACGTCCAGA CTCCGCTTCA 
19 51 TGAACATGGA CTACGTCGTG 
2 0 01 GCCGAGCGTC TCACCTCGAG 
2 051 CGAGCGGGCG CGGCGCCCCG 
2101 ACGATATCCA CAGGGCCTGG 
2151 GACCCGCCGC CTGAGCTGTA 
2 2 01 CGACACCATC CCCCAGGACA 

22 51 AACCCCAGAA CACGTACTGC 
2 3 01 GCCCATGGGC ACGTCCGCAA 

23 51 AGACCTCCAG CCGTACATGC 

24 01 GCCCGCTGAG GGATGCCGTC 
24 51 GCCAGCAGTG GCCTCTTCGA 



GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
ACGCTANTGG ACCCGAANGC GTCTGGGATC 
CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
GCCCACCCGG GCAGGACGCC TGGACCGAGT 
GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
CTGGCACGCG CCACTCCCAC CCATCCGTGG 
CCCCCATCCA CATCGCGGCC ACCACGTCCT 
GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
TGCGNCCCTC CTTCCTACTC AATATATCTG 
TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 
GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 
CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 
TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 
GCTGACGTGG AAGATGAGCG TGCGGGACTG 
CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
GGCTCACGGA GGTCATCGCC AGCATCATCA 
GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
GACAGTTCGT GGCTCACCTG CAGGANAACA 
GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
CGTCTTCCTA CGCTTCATGT GCCACCACGC 
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2 5 01 CGTGCGCATC AGGGGCAAGT 

2 551 GCTCCATCCT CTCCACGCTG 

26 01 AACAAGCTGT TTGCGGGGAT 

2 6 51 GGATGATTTC TTGTTGGTGA 

2 7 01 TCAGGACCCT GGTCCGAGGT 

2 751 CGGAAGACAG TGGTGAACTT 

2 8 01 GGCTTTTGTT CAGATGCCGG 

2 8 51 TGCTGGATAC CCGGACCCTG 

2 901 CGGACCTCCA TCAGAGCCAG 

2 951 GAGGAACATG CGTCGCAAAC 

3 001 GCCTGTTTCT GGATTTGCAG 
3 051 ATCTACAAGA TCCTCCTGCT 
3101 GCAGCTCCCA TTTCATCAGC 
3151 GCGTCATCTC TGACACGGCC 
3 2 01 AACGCAGGGA TGTCGCTGGG 
3 251 CGAGGCCGTG CAGTGGCTGT 
3 3 01 GACACCGTGT CACCTACGTG 
3 3 51 ACGCAGCTGA GTCGGAAGCT 
3 4 01 CGCAGCCAAC CCGGCACTGC 
34 51 GGCCACCCGC CCACAGCCAG 
3 5 01 GCCGGGCTCT ACGTCCCAGG 
3 551 ACCGCTGGGA GTCTGAGGCC 
3 6 01 CCGGCTGAAG GCTGAGTGTC 
3 6 51 GGGCTGAGTG TCCAGCACAC 
3 701 CTCGGCTCCA CCCCAGGGCC 
3 751 CTCCCCACAT AGGAATAGTC 
3 8 01 GCCCTGCCCT CCTTTGCCTT 
3 851 GAAGGACCCT GGGAGCTCTG 
3 9 01 TACACAGGCG AGGACCCTGC 

3 951 GGGGGGAGGT GCTGTGGGAG 

4 001 TTGAAAAAAA AAAAAAAAAA 



CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
CTCTGCAGCC TGTGCTACGG CGACATGGAG 
TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
CACCTCACCT CACCCACGCG AAAACCTTCC 
GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
GAGGTGCAGA GCGACTACTC CAGCTATGCC 
TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
GTGAACAGCC TCCAGACGGT GTGCACCAAC 
GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
CCACTCCTGG GGTCACTCAG GACAGCCCAG 
CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
CCTCAGACTT CAAGACCATC CTGGACTGAT 
GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
CATCCCCAGA TTCGCCATTG TTCACCCCTC 
CCACCCCCAC CATCCAGGTG GAGACCCTGA 
GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
TAAAATACTG AATATATGAG TTTTTCAGTT 
AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 

]_ ^. + + ^ + ^ gQ 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

a AALRPAAHVGSPGPGHPRDA 

b QRCVLLRTWEALAPATPAMP- 

c SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 

61 + + + + + + 120 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

a ARSPLPSRALPAAQPLPRGA 

b RAPRCRAVRSLLRSHYREVL- 

C ALPAAEPCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 
121 + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

a AAGHVRAAPGAPGLAAGAAR 

b PLATFVRRLGPQGWRLVQRG- 

C RWPRSCGAWGPRAGGWCSAG- 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCTysrTGCNTGGTGTGCGTG 
181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

a GPGGFPR?GGP??GVRALG? 

b DPAAFRA?VA?C?VCVPW??- 

c TRRLSARWWP?AWCACPG??- 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + + + + + 300 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

a ?AAPRRPLLPPGVLPE??GG 

b ??PPAAPSFRQVSCL??LVA- 

c G?PPPPPPSARCPA*??WWP- 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 
301 + + + + + + 360 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 

a PSAA?AVR?RRE?RAGLRIjR 

b RVL??LC?RGA?NVLAFGFA- 

C ECC??CA?AAR?TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 
361 + + + + + + 420 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

a AAGRGPRGPPRGLHHQRAQL 

b LLD.GARGGPPEAFTTSVRSY- 

C CWTGPAGAPPRPSPPACAAT- 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

a PAQHGDRRTAGERGVGAAAA 

b LPNTVTDALRGSGAWGLLLR- 

c CPTR*PTHCGGAGRGGCCCA- 
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a PRGRRRAGSPAGTLR??CAG 

b RVGDDVLVHLLARCA?FVLV- 

C AWATTCWFTCWHAAR?LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 

541 + + + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

a GSQLRLP7VRAAAVPARRC? 

b ?PSCAY?VCGPPLYQLGAAT- 

C ?PAAPT?CAGRRCTSSAL?L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 

601 + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

a SGPAPATR?WTR?RLGSNGP 

b QARPPPHA?GPE?VWDPTGL- 

c RPGPRHTL?DP?ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

661 + + + + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

a GTIASGRPGSPWAASPGCEE 

b EP*RQGGRGPPGLPAPGARR- 

C NHSVREAGVPLGCQPRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 
721 -}. + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

a ARGQCQPKSAVAQEAQAWRC 

b RGGSASRSLPLPKRPRRGAA- 

C AGAVPAEVCRCPRGPGVALP- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + - + + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

a P*AGADARWAGVLGPPGQDA 

b PEPERTPVGQGSWAHPGRTP- 

C LSRSGRPLGRGPGP TRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 + + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

a WTE*PWFLCGVTCQTRRRSH 

b GPSDRGFCVVSPARPAEEAT- 

C DRVTVVSVWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 
901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

a LFGGCALWHAPLPPIRGPPA 

b SLEGALSGTRHSHPSVGRQH- 

C LWRVRSLARATPTHPWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCGCCG 
961 + + + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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PRGPPIHIAATTSWDTPCPP 
HAGPPSTSRPPRPGTRLVPR- 
TRAPHPHRGHHVLGHALS PG- 

GTGTACGCCGAGACCAAGC7VCTTCCTCTACTCCTCAGGCGACAAGNACACTGCGNCCCTC 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTGTTCmX3TGACGCNGGG 

VYAETKHFLYSSGDK?TA?L 
CTPRPSTSSTPQAT7TLRPS - 
VRRDQALPLLLRRQ7HC? PP- 

CTTCCTACTCAATATATCTQAGGCCCAGCCTGACTGGCGTTCGGGAGGTTCGTGGAGACA 

1081 + + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

LPTQYI*GPA*LAFGRFVET 
FLLNISEAQPDWRSGGSWR? - 
SYSIYLRPSLTGVREVRGD?- 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCTGCCCCA 

1141 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

?FLVPGLGCQDSPQVAPPAP 
SFWFQALDARIPRRLPRLPQ - 
LSGSRPWMPGFPAGCPACPS- 

GCGNTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCC 

1201 + + + + + + 1260 

CGO^ATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACGGG 

A7LANAAPVSGAAWEPRAVP 
RYWQMRPLFLELLGNHAQCP - 
7TGKCGPCFWSCI.GTTRSAP- 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGG 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

LRGVPQDALPAASCGHPSSR 
YGVFLKTHCPLRAAVTPAAG- 
TGCS SRRTARCELRS PQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTGTGTC 

CLCPGEAPGLCGGPRGGGTQ 
VCAREKPQGSVAAPEEEEHR- 
SVPGRS PRALWRPPRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 
1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

TPVAWCSCSASTAAPGRCTA 
PPSPGAAAPPAQQPLAGVRL - 
PRRLVQLLRQHSSPWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 
1441 + + + + ^ ^ ]_5QQ 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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a SCGPACAGWCPQASGAPGTT 

b RAGLPAPAGAPRPLGLQAQR- 

c VRACLRRLVPPGIiWGSRHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCT 

1501 + + + + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCCTTCGTACGGTTCGAGA 

a NAASSGTPRSSSPWGSMPSS 

b TPLPQEHQEVHLPGEACQAL- 

C RRFLRN.TKKFISLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCC 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

a RCRS*RGR*ACGTALGCAGA 

b AAGADVEDERAGLRLAAQEP- 

C LQELTWKMSVRDCAWLRRSP- 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGT 

1621 4- + + + + + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

a QGLAVFRPQSTVCVRRSWPS 

b RGWLCSGRRAPSA*GDPGQV- 

C GVGCVPAAEHRIiREEILAKF- 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATO 

1681 + + + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

a SCTG**VCTSSSCSGLSFMS 

b PALADECVRRRAAQVFLLCH- 

C LHWLMSVYVVELLRSFFYVT- 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGC^ 

1741 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGATGGCCTTCTCACAGACCTCGTTCA 

a RRPRFKRTGSFS TGRVSGAS 

b GDHVSKEQALFLPEECLEQV- 

c ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAA 

1801 + + + + + + 1860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTCGACAGCCTTC 

a CKALESDST*RGCSCGSCRK 

b AKHWNQTALEEGAAAGAVGS- 

c QSIGIRQHLKRVQLREL.SEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

a QRSGSIGKPGPPC*RPDSAS 

b RGQAASGSQARPADVQTPLH- 

C EVRQHREARPALLTSRLRFI- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGA^ 
1921 + + + + + + 1980 



AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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a SPSLTGCGRL*TWTTSWEPE 

b PQA*RAAADCEHGLRRGSQN- 

C PKPDGLRPIVNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCG 

1981 + + + + + + 2040 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCCACTTCCGTGAC^ 

a RSAEKRGPSVSPRG*RHCSA 

b VPQRKEGRASHLEGEGTVQR- 

C FRREKRAERLTSRVKALFSV- 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGG 

2041 + + + + + + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

a CSTTSGRGAPASWAPLCWAW 

b AQLRAGAAPRPPGRLCAGPG- 

C LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

a TISTGPGAPSCCVCGPRTRR 

b RYPQGLAHLRAACAGPGPAA- 

c DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACA 

2161 + -I- + + + + 2220 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

a LSCTLSRWM*RARTTPSPRT 

b *AVLCQGGCDGRVRHHPPGQ- 

C ELYFVKVDVTGAYDTIPQDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGT 

2221 + + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGGGGTCTTGTGCATGACGCACGCAG 

a GSRRSSPASSNPRTRTACVG 

b AHGGHRQHHQTPEHVLRASV- 

c LTEVIASIIKPQNTYCVRRY- 

ATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

2281 + + + + + + 2340 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 

a MPWSRRPPMGTSARPSRATS 

b CRGPEGRPWARPQGLQEPRL- 

C AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 

2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTCAAGCACCGAGTGGACGTCCTlSm 

a LP*QTSSRTCDSSWLTCR?T 

b YLDRPPAVHATVRGSPAG7Q- 

C TLTDLQPYMRQFVAHLQ7NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

2401 + + + + -f + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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a AR*GiyiPSSSSRAPP*MRPAV 

b PAEGCRRHRAELLPE*GQQ W- 

C PLRDAVVIEQSSSLNEASSG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGT 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 

a ASSTSSYASCATTPCASGAS 

b PLRRLPTLHVPPRRAHQGQV- 

c LFDVFLRFMCHHAVRIRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCC 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

a PTSSARGSRRAPSSPRCSAA 

b LRPVPGDPAGLHPLHAALQP- 

C YVQCQGIPQGSILSTLLCSL- 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACC^ 

2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

a CATATWRTSCLRGFGGTGCS 

b VLRRHGEQAVCGDSAGRAAP- 

c CYGDMENKLFAGIRRDGLLL- 

TGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCG^^ 

2641 + + + + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCTTTTGG^ 

a CVWWMISCW*HLTSPTRKPS 

b AFGG*FLVGDTSPHPRENLP- 

C RLVDDFLLVTPHLTHAKTFL- 

TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAG 

2701 + + + + + + 2760 

AGTCCTGGGACCAGGCTCCT^CAGGGACTCATACCGACGCACCACTTGAACGCCTTCTGTC 

a SGPWSEVSLSMAAW*TCGRQ 

b QDPGPRCP*VWLRGELAEDS- 

c RTLVRGVPEYGCVVN'LRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTT^ 

2761 + + + + + + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTACGGCC 

a W*TSL*KTRPWVARLLFRCR 

b GELPCRRRGPGWHGFCSDAG- 

c VNFPVEDEALGGTAFVQMPA- 

CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

2821 + + + + + + 2880 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

a PTAYSPGAACCWIPGPWRCR 

b PRPIPLVRPAAGYPDPGGAE- 

C HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCC7VGTCTC71CCTTCAACCGCGGCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTGGAAGTTGGCGCCGA 
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a ATTPAMPGPPSEPVSPSTAA 

b RLLQLCPDLHQSQSHLQPRL- 

C DYSSYARTSIRASLTFNRGF- 

TCT^GGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCAC^ 

2941 + + + + + + 3000 

AGTTCCGACCCTCCTTGTACGCAGCGTrrGAGAAACCCCAGAACGCCGACTTCACAGTGT 

a SRLGGTCVANSLGSCG*SVT 

b QGWEEHASQTLWGLAAEVSQ- 

C KAGRNMRRKLFGVLRLKCHS- 

GCCTGTTTCTGGATTTGCT^GGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 

3001 + + + + + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 

a ACFWICR*TASRRCAPTSTR 

b PVSGFAGEQPPDGVHQHLQD- 

c LFLDLQVNSLQTVCTNIYKI- 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATC^ 

3061 + + + + + + 3120 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

a SSCCRRTGFTHVCCSSHFIS 

b PPAAGVQVSRMCAAAPISSA- 

C LLLQAYRFHACVLQLPFHQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCT 

3121 + + + + + + 3180 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGGAGGGAGACGA 

a KFGRTPHFSCASSLTRPPSA 

b SLEEPHIFPARHL*HGLPLL- 

C VWKNPTFFLRVISDTASLCY- 

ACTCCT^TCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCC 

3181 + + + + 4- + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTCCCGCGGCGGCCGG 

a TPS*KPRTQGCRWGPRAPPA 

b LHPESQERRDVAGGQGRRRP- 

C SILKAKNAGMSLGAKGAAGP- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACTGAG 

a LCPPRPCSGCATKHSCSS*L 

b SALRGRAVAVPPSIPAQADS- 

c LPSEAVQWLCHQAFLLKLTR- 

GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

a DTVSPTCHSWGHSGQPRRS* 

b TPCHLRATPGVTQDSPDAAE- 

c HRVTYVPLtiGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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a VGSSRGRR*LPWRPQPTRHC 

b SEAPGDDADCPGGRSQPGTA- 

C RKLPGTTLTALEAAANPALP- 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGC^ 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ- 

C SDFKTILD*WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 
3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSSPVTPGSTSQGGRGGPH 

b TPAALSRRALRPREGGAAHT- 

C HQQPCHAGLYVPGREGRPTP- 

CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGC^ 
3541 + + + + ^ _^. 3500 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTACA 

a PGPHRWESEA*VSVWPRPAC 

b QARTAGSLRPE*VFGRGLHV- 

c RPAPLGV*GLSECLAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTG 

3601 + + + + + + 3660 

GGCCGACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACTCAC 

a PAEG*VSG*GLSECPAKG*V 

b RLKAECPAEA*ASVQPRAEC- 

C G*RLSVRLRPERVSSQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCC 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

a SSTPAVFTSPQAGARLHPRA 

b PAHLPSSLPHRLALGSTPGP- 

C QHTCRLHFPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 
3721 + + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFSSPGARLPLPT*E*SIPR 

b AFPHQEPGFHSPHRNSPSPD- 

C LFLTRSPASTPHIGIVHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

C RHCSPLALPSFAFHPHHPGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTG 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 
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a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC- 

C DPEKDPGSSGNLE*PKVCPV- 

TACy^CAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTC^^ 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

a YTGEDPAPGWGSLWVKLGGG 

b TQARTLHLDGGPCGSNWGEV- 

C HRRGPCTWMGVPVGQIGGRC- 

GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAA 

3961 + + 4. + + + 4020 

CGACT^CCCTCATTTTATGACTTATATACTCAAAAAGTCT^AAACTTTT^ 

a AVGVKY*IYEFFSFEKKKKK 

b LWE*NTEYMSFSVLKKKKKK- 

C CGSKILNI*VFQF*KKKKKK- 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 

a K K K - 

b K K - 

c K K - 
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1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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leu 
CTG 


gly 

GGA 


cys 
TGC 


glu 

GAA 


arg 
CGG 


val 
GTC 


pro 
CCC 


leu 
CTG 


gly 
GGC 


leu 
CTG 


ser 
AGT 


ala 
GCC 


ser 
AGC 


230 
arg 
CGA 


ser 
AGT 


ala 
GCT 


ala 
GCC 


pro 
CCT 


glu 
GAG 


pro 
CCG 


ala 
GCC 


his 
CAC 


pro 
CCG 


260 

gly 

GGC 


arg 
AGG 


val 
GTG 

gly 
GGT 


val 
GTG 

ala 
GCG 


ser 
TCA 

leu 
CTC 


pro 
CCT 
290 
ser 
TCT 


ala 
GCC 

gly 
GGC 


gin 
CAG 


his 
CAC 


his 
CAC 


ala 
GCG 


gly 
GGC 


trp 
TGG 


asp 
GAC 


thr 
ACG 


320 
pro 
CCT 


cys 
TGT 


leu 
CTC 


tyr 
TAG 


ser 
TCC 


ser 
TCA 


gly 
GGC 


leu 
CTC 


ser 
AGC 


ser 
TCT 


350 
leu 
CTG 


arg 
AGG 


glu 
GAG 


thr 
ACC 


ile 
ATC 


phe 
TTT 


leu 
CTG 


arg 
CGC 


arg 
AGO 


leu 
TTG 


380 
pro 
CCC 


arg 
CGC 


leu 
CTG 


phe 
TTT 


leu 
CTG 


glu 

GAG 


leu 
CTG 


val 
GTG 


leu 
CTC 


leu 
CTC 


410 
lys 
AAG 


thr 
ACG 
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ala 
GCC 


trp 
TGG 


asn 
AAC 


his 
CAT 


ser 
AGC 


pro 
CCA 


ala 
GCC 


pro 
CCG 


220 
gly 
GGT 


ala 
GCG 


leu 
CTG 


pro 
CCG 


leu 
TTG 


pro 
CCC 


lys 
AAG 


glu 
GAG 


arg 
CGG 


thr 
ACG 


250 
pro 

CCC 


val 


thr 
ACG 


arg 
CGT 


gly 
GGA 


pro 
CCG 


ser 
AGT 


arg 
AGA 


pro 
CCC 


ala 
arc 


280 
glu 
GAA 


glu 

HA A 


thr 
ACG 


arg 
CGC 


his 

CAC 


ser 
TCC 


his 
CAC 


pro 
CCC 


pro 
CCA 


ser 
TCC 


310 
thr 
ACA 


ser 
TCG 


pro 
CCC 


pro 
CCG 


val 
GTG 


tyr 
TAG 


ala 
GCC 


asp 
GAC 


lys 
AAG 


glu 
GAG 


340 
gin 
CAG 


leu 
CTG 


pro 
CCC 


ser 
AGC 


leu 
CTG 


thr 
ACT 


gly 

GGC 


gly 

GGT 


ser 
TCC 


arg 
AGG 


370 
pro 
CCC 


trp 
TGG 


leu 
CTG 


pro 
CCC 


gin 
CAG 


arg 
CGC 


tyr 
TAG 


leu 
CTT 


gly 
GGG 


asn 
AAC 


400 
his 
CAC 


ala 
GCG 


his 
CAC 


cys 
TGC 


pro 
CCG 


leu 
CTG 


arg 
CGA 



210 

val arg glu ala gly 
GTC AGG GAG GCC GGG 



arg arg arg gly gly 
AGG AGG CGC GGG GGC 

240 

arg pro arg arg gly 
AGG CCC AGG CGT GGC 



gly gin gly ser trp 
GGG CAG GGG TCC TGG 

270 

asp arg gly phe cys 
GAC CGT GGT TTC TGT 



ala thr ser leu glu 
GCC ACC TCT TTG GAG 
300 

pro ser val gly arg 
CCA TCC GTG GGC CGC 



arg pro pro arg pro 
CGG CCA CCA CGT CCC 

330 

glu thr lys his phe 
GAG ACC AAG CAC TTC 



arg pro ser phe leu 
CGG CCC TCC TTC CTA 

360 

ala arg arg leu val 
GCT CGG AGG CTC GTG 



met pro gly thr pro 
ATG CCA GGG ACT CCC 

390 

trp gin met arg pro 
TGG CAA ATG CGG CCC 



gin cys pro tyr gly 
CAG TGC CCC TAG GGG 

420 

ala ala val thr pro 
GCT GCG GTC ACC CCA 
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430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 
GCA GCC GGT GTC TGT GCC CGG GAG AAG' GCC CAG GGC TCT GTG GCG 

440 450 
ala pro glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 
ala cys leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 
GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg lys ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 

610 

ser glu ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 
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650 660 

gl^ lys ^irg ala glu arg leu thr ser arg val lys ala leu 

AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp lie his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr lie pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 

770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

800 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 
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leu 
TTG 


val 
GTG 


thr 
ACA 


pro 
CCT 


his 
CAC 


leu 
CTG 


val 
GTC 


arg 
CGA 


890 
gly 
GGT 


val 
GTC 


lys 
AAG 


thr 
ACA 


val 
GTG 


val 
GTG 


asn 
AAC 


thr 
ACG 


ala 
GCT 


phe 
TTT 


920 
val 
GTT 


gin 
CAG 


gly 

GGC 


leu 
CTG 


leu 
CTG 


leu 
CTG 


asp 
GAT 


ser 
TCC 


ser 
AGC 


tyr 
TAT 


950 
ala 
GCC 


arg 
CGG 


arg 
CGC 


gly 
GGC 


phe 
TTC 


lys 
AAG 


ala 
GCT 


val 
GTC 


leu 
TTG 


arg 
CGG 


980 
leu 
CTG 


lys 
AAG 


asn 
AAC 


ser 
AGC 


leu 
CTC 


gin 
CAG 


thr 
ACG 


leu 
CTG 


gin 
CAG 


ala 
GCG 


1010 
tyr arg 
TAC AGG 


his 
CAT 


gin 
CAG 


gin 
CAA 


val 
GTT 


trp 
TGG 


ser 
TCT 


asp 
GAC 


thr 
ACG 


104C 

ala 

GCC 


ser 
TCC 


ala 
GCA 


gly 
GGG 


met 
ATG 


ser 
TCG 


leu 
CTG 



1070 

ser glu ala val gin 
TCC GAG GCC GTG CAG 



leu thr arg his arg 
CTG ACT CGA CAC CGT 
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880 



leu 
CTC 


thr 
ACC 


his 
CAC 


ala 
GCG 


lys 
AAA 


pro 
CCT 


glu 
GAG 


tyr 
TAT 


gly 

GGC 


cys 
TGC 


phe 
TTC 


pro 
CCT 


val 
GTA 


910 
glu 
GAA 


asp 
GAC 


met 
ATG 

thr 
ACC 


pro 
CCG 

arg 
CGG 


ala 
GCC 

thr 
ACC 


his 
CAC 
940 
leu 
CTG 


gly 

GGC 

glu 

GAG 


thr 
ACC 


ser 
TCC 


ile 
ATC 


arg 
AGA 


ala 
GCC 


gly 
GGG 


arg 
AGG 


asn 
AAC 


970 
met 
ATG 


arg 
CGT 


cys 
TGT 


his 
CAC 


ser 
AGC 


leu 
CTG 


phe 
TTT 


val 
GTG 


cys 
TGC 


thr 
ACC 


1000 
asn ile 
AAC ATC 


phe 
TTT 


his 
CAC 


ala 
GCA 


cys 
TGT 


val 
GTG 


lys 
AAG 


asn 
AAC 


pro 
CCC 


1030 
thr phe 
ACA TTT 


leu 
CTC 


cys 
TGC 


tyr 
TAC 


ser 
TCC 


ile 
ATC 


gly 
GGG 


ala 
GCC 


lys 
AAG 


1060 
gly ala 
GGC GCC 


trp 
TGG 


leu 
CTG 


cys 
TGC 


his 
CAC 


gin 
CAA 


val 
GTC 


thr 
ACC 


tyr 
TAC 


1090 

val 

GTG 


pro 
CCA 



thr phe leu arg thr 
ACC TTC CTC AGG ACC 

900 

val val asn leu arg 
GTG GTG AAC TTG CGG 



glu ala leu gly gly 
GAG GCC CTG GGT GGC 

930 

leu phe pro trp cys 
CTA TTC CCC TGG TGC 

val gin ser asp tyr 
GTG CAG AGC GAC TAC 

960 

ser leu thr phe asn 
AGT CTC ACC TTC AAC 



arg lys leu phe gly 
CGC AAA CTC TTT GGG 

990 

leu asp leu gin val 
CTG GAT TTG CAG GTG 



tyr lys ile leu leu 
TAC AAG ATC CTC CTG 

1020 

leu gin leu pro phe 
CTG CAG CTC CCA TTT 



phe leu arg val ile 
TTC CTG CGC GTC ATC 

1050 

leu lys ala lys asn 
CTG AAA GCC AAG AAC 



ala gly pro leu pro 
GCC GGC CCT CTG CCC 

1080 

ala phe leu leu lys 
GCA TTC CTG CTC AAG 



leu leu gly ser leu 
CTC CTG GGG TCA CTC 
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1100 1110 
arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG GAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 



1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr lie leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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KPN1 17050 
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SAC1 21083 
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